Picture for Guanghao Zhang

Guanghao Zhang

DynFrame: Adaptive Reasoning-Driven Multimodal Framework with Dynamic Frame Augmentation for Complex Video Understanding

Add code
May 26, 2026
Viaarxiv icon

Think When Needed: Adaptive Reasoning-Driven Multimodal Embeddings with a Dual-LoRA Architecture

Add code
May 14, 2026
Viaarxiv icon

XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments

Add code
Apr 20, 2026
Viaarxiv icon

DSI-Bench: A Benchmark for Dynamic Spatial Intelligence

Add code
Oct 21, 2025
Viaarxiv icon

CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation

Add code
Mar 07, 2025
Figure 1 for CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation
Figure 2 for CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation
Figure 3 for CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation
Figure 4 for CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation
Viaarxiv icon

MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation

Add code
Mar 03, 2025
Viaarxiv icon

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Add code
Feb 10, 2025
Viaarxiv icon

Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning

Add code
Oct 15, 2024
Viaarxiv icon

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Add code
Aug 28, 2024
Viaarxiv icon

WPS-Dataset: A benchmark for wood plate segmentation in bark removal processing

Add code
Apr 17, 2024
Figure 1 for WPS-Dataset: A benchmark for wood plate segmentation in bark removal processing
Figure 2 for WPS-Dataset: A benchmark for wood plate segmentation in bark removal processing
Figure 3 for WPS-Dataset: A benchmark for wood plate segmentation in bark removal processing
Figure 4 for WPS-Dataset: A benchmark for wood plate segmentation in bark removal processing
Viaarxiv icon